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1. Introduction 


This paper investigates the welfare consequences of strategic behaviour under approval voting (AV) by comparing utilitarian 
efficiencies obtained with Expected Utility maximising voting behaviour (EU behaviour) and with Sincere Voting behaviour (SV 
behaviour). Under SV behaviour voters are assumed to vote for all those candidates for which the utility exceeds the average for all 
candidates (Merrill, 1979; Brams and Fishburn, 1983, p. 85; Ballester and Rey-Biel, 2007). Under EU behaviour voters give their 
votes to different candidates depending on expected-utility calculations (Merrill, 1981a,b). They give a vote to a candidate under EU 
behaviour if the expected gain from doing so is positive. The difference between EU and SV behaviour under AV is thus that the 
voters are engaged in probability calculations in the former but not in the latter (see e.g., Niemi, 1984). 

A voter is usually defined as voting sincerely under AV if he or she gives a vote to all candidates standing higher in his or her 
ranking than the lowest-ranking candidate for whom he or she gives a vote. There are no ‘holes’ in a voter's approval set. If 
insincere voting is defined as not voting sincerely, it is commonly considered to be rare under AV. In contrast, strategic behaviour is 
not rare under AV, and the focus of this paper is on the welfare consequences of such behaviour rather than insincere or ‘strategic’ 
voting. The welfare consequences of strategic voting under AV are thus not studied here, if it is defined by the fact that a voter gives 
his or her vote to a candidate who is lower in his or her ranking than some candidate for whom he or she does not vote (see e.g., 
Brams and Sanver, 2006). 


* Tel.: +358 919 129 284. 

E-mail address: aki.lehtinen@helsinki.fi. 
1 although Brams and Fishburn (1983, p. 85) use an expected-utility terminology, their mean utility rule is classified as sincere here. 
? See eg., Brams and Fishburn (1978, 1983, p. 29) and Brams and Sanver (2006). 
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Table 1 
Voters types and utilities 


Type of voters 


ty ity fin fi, tis te U! 
x Y z x y Z Ui 
y Z x Zz x y U, 
Z xe y Zz x US 


Utilitarian efficiency is defined as the percentage of simulated voting games in which the candidate that maximises the sum 
of voters’ utilities (the utilitarian winner) is selected (e.g., Merrill, 1988). The main finding is that utilitarian efficiencies are high 
under AV irrespective of the behavioural assumption used. Furthermore, given that the aim is to also evaluate whether it might be 
reasonable to introduce AV in mass elections, these efficiencies are compared to those under plurality voting (PV). It is shown that 
utilitarian efficiencies are higher under AV than under PV. Indeed, utilitarian efficiencies are higher than under any voting rule that 
has been studied with similar methods (see Lehtinen, 2006b, 2007a,b). 

Brams and Fishburn (e.g., 1983, 2005) have presented various arguments for AV. I will try to clarify or modify at least two of 
them. One argument is that this rule takes information on preference intensities into account (Brams et al., 1988). AV differs from 
other commonly used voting rules in that it allows for expressing intensity information even with SV behaviour, and in that voters 
never need to abandon their most-preferred candidate when they engage in strategic behaviour (e.g., Brams and Fishburn, 2005). 
However, thus far this intensity argument has been based on the mere intuition that since approval information is closely related to 
intensity information, it may be expressed under AV. This paper provides a formal model with which this question can be explicitly 
studied. The findings provide confirmation that this argument is correct under both behavioural assumptions. On the other hand, 
they imply that given a utilitarian evaluation of outcomes, the beneficial features of AV do not depend on the somewhat 
questionable assumption that voters have ‘dichotomous’ preferences (Brams and Fishburn, 1983, Ch. 2-3): AV is the best rule in 
terms of reflecting preference intensities.’ 

It has also been claimed that AV makes strategic voting unnecessary (Brams and Fishburn, 1978), but the welfare effects of 
strategic behaviour are still considered somewhat controversial. As (Niemi, 1984) has argued (see also van Newenhizen and Saari, 
1988a,b), even though strategic voting may be rare under AV, even sincere voting may require a considerable amount of strategic 
thinking under this rule. The findings reported here show that whether or not voters engage in strategic calculations, AV yields 
high utilitarian efficiencies and thus often selects candidates with broad public appeal (cf. Brams and Fishburn, 1983, pp. 135, 171). 

Strategic voting increases utilitarian efficiency under various voting rules because it allows for expressing preference intensities 
(Lehtinen, 2006b, 2007a,b). These results depend on the counterbalancing of strategic votes: broadly accepted candidates are likely 
to obtain many strategic votes and to lose few. Given that the decision to give more than one vote under AV is analogous to giving a 
strategic vote under other voting rules, there is also counterbalancing under AV. Counterbalancing of second votes implies that 
broadly supported candidates obtain more second votes than candidates facing strong opposition. This explains why utilitarian 
efficiencies under EU behaviour are also high under this rule. 

Voters’ beliefs are derived by combining methods of computing pivot probabilities (Hoffman, 1982; Cranor, 1996) with a signal- 
extraction model similar to the one provided by Lehtinen (2006a), and to global games (Carlsson and van Damme, 1993; Morris and 
Shin, 2003). Voters obtain noisy signals of the true structure of the game and formulate beliefs on the basis of that. 

Computer simulations are used for deriving the results because there is a large number of agents that are heterogeneous both 
with respect to their beliefs and to their preferences, and the aggregate-level outcomes depend on the voting interaction. 
Aggregating individual votes in an analytical model would be very difficult.* 

The structure of the paper is the following. Optimal strategies under approval voting are formulated in Section 2. Section 3 
introduces the model of incomplete information by describing the assumptions about voters’ signals and beliefs. Section 4 
describes the computer simulations. The simulation results are presented in Section 5. Section 6 shows that the results are robust 
with respect to different interpersonal comparisons of utilities. Section 7 presents the conclusions. 


2. Strategic behaviour under approval and plurality voting 


Let X={x,y,z} denote the set of candidates (with generic members j, k and m). The six possible types of voters and their 
preference orderings are presented in Table 1 below. Ui, denotes voter i's payoff for the kth best candidate. 

Under AV, voters give a vote to any number of candidates. Let N=2001 denote the total number of voters, and let nj denote the 
number of voters who prefer candidate j the most. Given that the aim is to study mass elections, the maximum computationally 
feasible number of voters was selected. Changing the size of the electorate will be briefly considered later in discussing the 
simulation results. 


Let nj’ denote the number of votes candidate j obtains under sincere behaviour under AV, and let n“Y denote the total number 


of votes cast under AV. Let v;”, vy”, and vY denote the vote shares of candidates x, y and z if all voters vote sincerely under PV: 


3 This result seems to imply that modifying AV is unnecessary (see Yilmaz, 1999). 
4 See Lehtinen and Kuorikoski (2007) for a discussion on simulations in economics. 


690 A. Lehtinen / European Journal of Political Economy 24 (2008) 688-704 


AV 


vPV — 3, and let ve’, vyY, and v3" denote similar vote shares under AV (vAV — a). Let pi,’’” =prob(v;”=vi" > Vn) denote the probability 
that voter i will be decisive in creating or breaking a first-place tie between j and k under PV, i.e. a pivot probability. pix“” denotes similar 


probabilities under AV. The expected gain in utility associated with voting for candidate j under AV is (Merrill, 1981b; Carter 1990) 
E = Spy’ [vi - U'(w)]. (1) 
ek 


If the superscripts denoting the individual voter and the voting rule are dropped, the expressions for expected gain for the three 
candidates are thus 


Ex = Pxy(U(x) — U(y)) + Pxe(U(x) — U(z)), (2) 

Ey = Pxy(U(y) — U(X)) + Pyz(U(y) — U(Z)), (3) 
and 

E, = Pxz(U(Z) — U(X)) + Pyz(U(Z) — U(y)). (4) 


Voters give a vote to a candidate if the expected gain from doing so is larger than zero (Merrill, 1981b; Carter, 1990). The 
conditions for strategic voting under PV can also be deduced from these equations once pie” V are replaced with Pie” V see McKelvey 


and Ordeshook (1972). A voter votes for the candidate who offers the highest expected gain. 
3. A signal extraction model for the pivot probabilities 


Voters will always give a vote to their most preferred candidate under approval voting (Brams and Fishburn, 1978). However, if 
all three pivot probabilities are exactly zero, all expected gains are zero. In such cases a voter is assumed to give a vote only to his or 
her most preferred candidate. One might argue that the voter has no incentive to vote in such a case. This complication is ignored 
here because the present model is not intended for modelling turnout. As the size of the electorate increases, the absolute pivot 
probabilities become infinitesimally small. However, people also vote and behave strategically in mass elections. How can this be, if 
all pivot probabilities are exactly zero? To account for such behaviour, it is important to see that even if the absolute pivot 
probabilities are zero, the relative probabilities of breaking a tie between different candidates may not be the same. After all, 
usually some candidates are higher in the polls than some others. 

Furthermore, since the voters are uncertain about the winning chances of the various candidates, and this uncertainty can be 
modeled with continuous distributions, it is possible to give a reasonable formal account of these relative probabilities. | will do so 
by way of a signal extraction model. As Myatt (2002a,b, 2007) and Fisher (Myatt and Fisher, 2002a,b) have argued, what is relevant 
for voters’ decisions are the relative rather than the absolute tie-probabilities. 

In real-world elections, voters’ signals and beliefs are based on polls, television broadcasts, and conversations with friends. It is 
assumed here that all these sources of information are being modelled by the signals explained below. 

Let v; denote a generic vote share. Voters obtain perturbed signals about vote shares: 


Sx = Vx + pi, (5) 

Sy = Vy + pRi, (6) 
and 

8S, =Vvz+ pRi, (7) 


where R; denotes a standard normal random variable, and p is a scaling factor that reflects the reliability of the signals (0 =[0.001, 
0.013])°. Since these perturbed vote shares must sum to one, they are rescaled as follows: 


5s (8) 


a ZkexSk 


The outcome space for a three-candidate election can be represented visually as a barycentric coordinate system — an 
equilateral triangle on the three-dimensional plane v,+v,+vz=1 (see Fig. 1). The point s=(s,,5,,sz) represents a voter's perturbed 
signal or ‘observation’. 


> The reason for this range will be explained in Section 5.1. 
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Fig. 1. Predicted vote shares in barycentric coordinates. 


Black (1978) calculated pivot probabilities by evaluating the distance between the point s and the line QT, which represents a tie 
between candidates x andz(Q = (4,4,0), and T = (§,4,4)). Hoffman (1982) showed how to take into account the voters’ degree of 
confidence by constructing a normal distribution around point s, see Fig. 2. 


Deriving pivot probabilities implies computing the integral 
Pie = J SAjeF (V1, V2, V3)). (9) 


Hoffman illustrates his method for a three-candidate (he labels the candidates 1, 2, and 3) election. As shown in Fig. 2, Hoffman 
specifies the region Aj, as the portion of the outcome triangle in which candidate j loses to candidate k by one vote. He defines the 
pivot probability as “the probability that the election result (vj,v2,v3) lies in the region Aj.” Thus pj, can be expressed as: 


Pir = J Sa,,F(P), (10) 


where D is the distance from the predicted outcome point to a point in the region Aj.. These volume calculations can be simplified 
by observing that each region Aj, is very narrow, only one vote wide. P,, can be approximated by using Simpson's rule or another 


F(¥,, ¥2s ¥3)) 


a 


¥3 


Wy=¥> ¥, 


Ay 


¥y 


Fig. 2. Hoffman's three-candidate election. 
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Smin Sy Ip Smax 


Fig. 3. Predicted vote shares. 


numerical integration technique to integrate over the outcome points for which v;=v,>Vm. Thus pj, is roughly proportional to the 
area of the face of pj, and we will only need the p;, ratios (Hoffman, 1982, p. 753). Hoffman's method thus provides the needed 
relative probabilities. 

Cranor (1996) observed that the only projected outcomes that are crucial to the pivot-probability calculation are the expected 
vote share of the candidate who is predicted to win, and the expected shares of the two candidates for whom the pivot probability 
is being computed. Let s,,3. denote the predicted vote share of the candidate who is expected to obtain the most votes, and let 
Smin(j,k) denote the predicted vote share of j or k, whichever is predicted to receive fewer votes. In calculating pj, using Cranor's 
method, first the predicted vote shares for the candidates are plotted on a line representing these shares, and then normal- 
distribution curves are constructed around each outcome point, as shown in Fig. 3. 

In this case x is predicted to be the winner and z the least popular candidate. The shaded area in Fig. 3, where the curves 
intersect, represents the relative probability p,, that the final outcome will involve a tie between candidates x and z. These 
probabilities are high when the predicted vote shares of two candidates are close to each other. Fig. 3 shows that the relative 
probability between x and z is smaller than that between x and y because the now shaded area is smaller than the area (which is not 
separately shaded) under the intersection of densities f, and fy. 

When one of the two candidates for whom a pivot probability is calculated is predicted to win, it is sufficient to draw just the 
two curves. However, in order to derive a pivot probability for two candidates who are not predicted to win (here, y and z), the 
outcome point for the predicted winner needs to be plotted, and a normal curve is constructed around it. The pivot probability is 
then the intersection of all three curves. Calculating the area under densities f, and f, does not thus provide a correct measure of 
the winning pivot probability, even though it correctly takes into account the relative chances of y and z. However, the 
intersection of all three curves provides an adequate measure because it takes into account the fact that x is the predicted winner 
in computing the winning pivot probability between y and z. Note that because the curves are all based on the same variance, 
their intersection is the same as the intersection of the leftmost and rightmost curves.° Hence, in order to derive pjx it is only 
necessary to determine the area of the intersection of the curve for the predicted winner and either j or k, whichever is predicted 
to obtain fewer votes. 

Let V; denote a random variable that represents the vote share for candidate j. Constructing a normal curve around point s 
means that the mean of V; is s;, and the density of V; is given by 


ee (11) 


where 07 is the variance of Vj. It is best interpreted as measuring voters' degree of confidence in their signals. The densities of V; and 
Vmax are similar. Let i, denote the point of intersection. At this point: 


1 (ip = Sabie 1 (ip = si)" 
ex = ex . 
V2n0 . ( 20? V2n0 : 20? 


fy) = 


© Fig. 3 displays the densities of the random variables as if the distributions were truncated. It is obvious that this is done merely to show the logic of the model 
more clearly. The actual distributions are not truncated. 
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If one of the two candidates for whom a pivot probability is being derived is expected to obtain most votes, i, can be derived 


from this expression, but as Cranor argued, the point of intersection between sax and the smaller of s; and s;, provides a more 
general expression: 


; 2 2 
kas (ip —Smax)\__ 1 — (ip = Sing)” 
V210 20 V/2no 202 


The point of intersection ip is thus: 


2 2, 
Smax — $ 


. min(j,k) 
ib =59-———_ +. (12) 
2 (Smax = Smin(ik) ) 


The pivot probability is then 


2 i "°F a | 7 Pay (13) 
= 2a 20 ’ 
” 210 J—s0 V2n0 Ji, 
Using the smaller of s; and s; again provides a more general expression: 
2 2 
1 b (w-smax)2 al °° (Sprint) ) 
ik == e 22 dw+ | e 2 dv. 14 
ee Taal. V2n0 Jip 4) 


Since the normal curves have the same variance 0%, 


2 [ __(v-smax)? d ( 1 5) 
k= — e 202 Vv. 
Pi V 210 J—« 
Let u = “*« so that dv=odu. When v=-, u=-~, and when v=ip, u = = After this change of variable, pj, can be expressed 
as 
2 i 24 ae 
ik = ——= e zdu. 
Ph V 20 J —20 


The pivot probability pj, is thus given by the standard normal distribution function @: 
Pik = 20(2— ine), (17) 
o 


It is clear that the voters do not explicitly compute their pivot probabilities with the formal precision described above. The point of 
the signal model is rather to provide a realistic account of voters' beliefs concerning the relative winning chances of the various 
candidates. The idea is to characterise those beliefs in terms of the reliability of the signals (the ‘quality’ of voters’ information) and 
voters’ confidence in them, by varying the corresponding parameters p and o. The model thus allows modeling beliefs that range from 
highly accurate to highly inaccurate, and at the same time taking voters' confidence in the quality of their information into account. 


4. Simulation and setups 


Asetup is a combination of assumptions used in a set of G=2000 simulated games. Expected utility setups differ with respect to 
the reliability of voters' signals (~), their confidence in the signals (0), and the degree of correlation between voter types and 
preference intensities (C) (see the next paragraph). In uniform setups voters' utilities are drawn from a uniform distribution on [0,1]’, 
while in setups with intensity correlation voter types three and five have systematically higher and types one and six systematically 
lower preference intensities for their second-best candidates x and y respectively. These setups are identical to the corresponding 
uniform setups with respect to all parameters except voters’ preference intensities. In order to generate setups with a correlation 
between this parameter and voter types without affecting the interpersonal comparisons or the preference orderings, the individual 
utilities were derived as follows. 

Uj, U2, and U3 were first generated from the uniform distribution on [0,1] for each voter.® U, and U3 were then used for defining 
the voter's utility scale as the [U3,U;] interval. A voter's utility for his or her middle candidate U; is referred to as the intensity. A 


7 The simulations were thus based on the impartial anonymous culture assumption: each voter type is equally likely. See Regenwetter et al. (2006). 
8 This assumption is tantamount to the impartial anonymous culture (IAC). It makes the simulated elections tighter than most real-world elections are. See 
Regenwetter et al. (2006). 
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Fig. 4. Ultimate efficiencies under AV. 


standardised intensity, 0, expresses what a voter's utility for his or her second-best candidate would be if the scale was the [0,1] 
interval. These standardised second-best utilities are referred to as intrapersonal intensities. The relationship between the 
standardised intrapersonal utility and the original scale of utility is given by 


yg Ur — US 
Me 1 G05" (18) 


In setups with an intensity correlation, these standardised intensities were multiplied by a parameter C, 0.5<C<1 for those who 
put y second (voter types one and six) so that the new correlated intensities US! and US° were given by 


~C ~ 
U5 = CUS, (19) 


In order to compensate for the decreases in utility for voter types one and six, the intensities for voters of types three and five 
(i.e. for x) were given by 


mie a 
U5 212€U, (20) 


These adjustments made the average utilities for x higher and the average utilities for y lower than in the uniform setups, while 
keeping the overall average utility fixed.° In uniform setups, C=1. C thus denotes the degree of correlation between preference 


intensities and voter types. 
These standardised intensities were then scaled back into the original [U3,U;] utility scale. Let U,* denote a voter's correlated 
intensity expressed in terms of the original [U3,U;] scale. U3 is given by: 


* ~C 
U, = U3 +U5(U; — U3). (21) 
5. Simulation results 


5.1. Reasonable parameter ranges 


The purpose of the simulations was to study how the various parameters affected utilitarian efficiencies. Although the choice of 
a reasonable range of values for these parameters is somewhat arbitrary, arguments in favour of certain ranges are given below. 
If the ‘observed’ vote share of candidate j was only determined by random perturbances, it would be given by pR;. The variance of 
this expression is p?. Since the vote shares are given by Y= if and nj is the sum of N independent Bernoulli trials with a success 
probability of },'° the variance of n; is } (1 — $)N = §N, and that of vis NN = 4. Ifrandomness affects the vote shares about as much 
as the realised profile of voter types, p? = ~ so that p = im which is about 0.0105 with 2001 voters. This number provides a fairly 


natural maximum value for p. 


° Note that the utility for the second-best candidate in uniform setups is 1 - US rather than US. Since US is drawn from a uniform distribution on [0,1], it does 
not matter which one is used. 
10 This follows from the IAC assumption. 
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Fig. 5. Utilitarian efficiencies under the plurality rule with o=p. 


Voters are not assumed to know the exact value of p. They merely know that their signals on vote shares are based on a normal 
distribution with some variance o~. The standard deviation of the difference between the highest and the lowest vote shares (std 
(Smax~ Smin)) With 2001 voters is 0.0112. This provides a fairly natural maximum value for o.'' Another way of looking at cis to take 
into account the fact that pre-election polls are often based on a sample size of 1000. If the voters viewed the published vote-share 
figures as resulting from Bernoulli trials with the aforementioned success probabilities, their standard deviation would be 

dcip00 = 0.0149. Since the reasonable values for cand p turned out to lie in the same region, the simulations were run with 0.001, 
0.005, 0.009, and 0.013 for both o and p. 


5.2. Counterbalancing and preference intensities 


The utilitarian efficiencies when p=o are displayed in Fig. 4.'* UEsy and UEgy stand for utilitarian efficiency under SV and EU 
behaviour, respectively. 

This figure shows that approval voting yields high utilitarian efficiencies irrespective of the behavioural assumption used. As 
expected, these efficiencies increase with an increase in the correlation between voter types and preference intensities, but this 
happens only if voters’ confidence in their signals is low. High utilitarian efficiencies, and their increase with the degree of 
correlation under EU behaviour, can be explained in terms of the counterbalancing of second votes; second votes are given to 
candidates with a relatively high utility. This can be seen from equations 2, 3, and 4. Consider, for example, voters of types three 
(prefer z to x to y) and five (prefer y to x to z). They will give a second vote to x if E,=pxy(U(x))-U(y))+ px(U(x) - U(z))>0. It is easy 
to see that ; ie ;20. Thus, the higher the average utility for x, the more often type-three and type-five voters give a second vote tox. 
A similar aroha shows that suey >0. Hence, the higher the average utility for y, the more often type-one and type-six voters give 
a second vote to y. In setups ‘with a Sirens correlation the average utility for x is high and that for y is low. Utilitarian efficiencies are 
thus high in setups with a strong correlation because many second votes for x are counterbalanced by few second votes for y. 


5.3. A comparison with plurality voting 


Fig. 5 shows utilitarian efficiencies under PV. 

How should we account for the fact that utilitarian efficiencies are higher under AV than under PV? If a voter believes that his or 
her most preferred candidate is likely to be in a close race with any of the other candidates, a strategically behaving voter gives a 
vote only to his or her most preferred candidate, engaging in ‘bullet voting’. Strategic calculations under AV may thus entail that a 
voter does not give a vote to a fairly intensively preferred candidate to whom he or she does give a vote under SV behaviour. Voters 
give a second vote only if they believe that their most preferred candidate does not have a chance of winning the election. The 
circumstances under which voters give a vote to two candidates rather than to one under AV are thus similar to the circumstances 
under which they give a strategic vote under PV: only voters who believe that their most preferred candidate does not have a 
chance of winning vote strategically under PV, and those same voters may have an incentive to give a second vote under AV. The 
fact that these circumstances are similar may account for the fact that utilitarian efficiencies are not lower under AV than under PV, 
but it does not explain why they are higher than under PV. 


" Cranor (1996, p. 92) uses the values 0.001 and 0.05 for 07. 
12 The full sets of data are available from the author on request. 
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Fig. 6. Utilitarian efficiencies under AV when voters' signals do not contain intensity information. 


There are three behaviour-related differences between AV and PV. The first is that under PV, but not under AV, strategic 
behaviour may entail abandoning the most preferred candidate (Brams and Fishburn, 1983, p. 69). AV is more flexible than PV in 
the sense that voters may vote for one or two candidates. The second is that although the decision to give a strategic vote under PV 
and the decision to give a second vote under AV are made on the basis of the same equations, these equations are used differently. A 
second vote is given under AV if the expected gain from doing so is positive, but a strategic vote is given under PV if the expected 
gain from giving the vote to a second-best candidate is higher than the expected gain from giving it to the best candidate. The third 
difference is that since voters have been assumed to obtain perturbed information on how other voters' would vote sincerely, 
voters’ signals contain information concerning preference intensities under AV, but not under PV. 

It is important to try to determine which of these differences is crucial because the assumptions upon which these possible 
explanations depend are not equally plausible. If the second difference is the important one, AV really does better than PV with 
respect to utilitarian efficiency under reasonable assumptions. However, if it is the first, many voters must be ‘mistaken’ about 
which candidate is likely to win under PV. Being forced to abandon the most preferred candidate in voting strategically may 
decrease utilitarian efficiency under PV if voters mistakenly abandon a candidate who would in fact have won had they not voted 
strategically. Voters give a second vote under AV when they believe that their most preferred candidate does not have a chance of 
winning, but since they are allowed to vote for their most preferred candidate as well, misjudging the probabilities may be less 
costly than under PV. Before trying to evaluate whether this conjecture is true, it is necessary to give PV a fairer treatment by 
eliminating some unrealistic strategic voting. 

All voter types may have an incentive to vote strategically even in setups with a strong correlation between voter types and 
preference intensities. Hence, under PV, even some type-one voters vote strategically for y in these setups. Note that since voters’ 
signals are based on perturbed information concerning the number of voters who put a given candidate first, these signals do not 
take preference intensity information into account in any way. Hence, even though x is always the utilitarian winner in setups with 
a strong correlation, voters quite often believe that he or she does not have a chance of winning. Even though the logic of 
counterbalancing implies that x obtains more strategic votes than y, this difference is often not sufficient to make x win. This logic 
also implies that if voters' decisions to vote strategically are driven by their beliefs rather than by their preference intensities, 
strategic behaviour may result in relatively low utilitarian efficiency. 

This, however, is what seems to happen under PV, given how the signals have been modelled: voters often give a strategic vote 
mainly because their expected gain from a sincere vote is exactly zero rather than very small. For example, if probabilities p,, and 
Pxz are exactly zero, the expected gain from voting for x is zero, and a type-one voter gives a strategic vote to y if py, is different 
from zero, even if this probability is 0.00001, and even if the intrapersonal preference intensity for y is 0.00001. It is clear that this 
belief-driven kind of strategic voting decreases utilitarian efficiency, and that it is not plausible that real people vote in this way. It is 
not plausible to assume that a voter gives a second vote or a strategic vote to the second-best candidate if his or her intrapersonal 
utilities are 1, 0.00001, and 0 for the best, the second-best and the worst candidate, respectively. If a voter thinks that his or her 
second-best candidate is just barely better than the worst candidate, he or she is not likely to vote strategically. After all, Ralph 
Nader did obtain votes in the 2000 US presidential elections, and it is surely not reasonable to explain all these votes, even in 
electoral districts in which the race between the major candidates was tight, by arguing that his supporters were irrational because 
they did not engage in strategic voting.” 


'3 It is clear that some people may also have supported Nader because they wanted to cast protest votes. See e.g., Burden (2005) for an account of these 
elections. 
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Fig. 7. Utilitarian efficiencies under PV with full intensity information in signals (A=0). 


5.4. The consequences of intensity information in the signals 


In an earlier version of this paper, voters were assumed to obtain perturbed information merely on other voters' best candidates 
under AV. Voters’ pivot probabilities and preferences are exactly the same under PV and AV under these assumptions. The results 
from such setups are presented in Fig. 6. 

A comparison of Figs. 4 and 6 shows that intensity information decreases the utilitarian efficiencies: when the signals only contain 
perturbed information on the most preferred candidates, the utilitarian efficiencies are higher! This result may seem counterintuitive 
at first because ‘better’ information leads to lower utilitarian efficiency, but it may be explained as follows. When the signals do not take 
intensity information into account, voters of types three and five very often give a second vote to their second-best candidate x. For 
example, with C=0.5 and p=0=0.013, about 87 per cent of these voters give a second vote, while y obtains second votes from only 
about 12 percent of type-one and type-six voters, and z obtains second votes from about 50 per cent of type-two and type-four voters. 
These differences derive purely from the differences between preference intensities for the various candidates. 

However, when voters' signals contain intensity information, as C decreases, almost all voters of types two and five vote 
strategically because they no longer believe that y, their most preferred candidate, has a chance of winning. But since they vote 
mainly because of this belief, counterbalancing no longer functions properly between these two voter types: almost all type-five 
voters vote for x, but at the same time, almost all type-two voters vote for z. Hence, the difference in their preference intensities for 
x and z does not show in their behaviour. At the same time, type-three voters give significantly fewer second votes to x because 
they (correctly) conceive of the election as a close race between x and z. For these reasons, z rather than x is sometimes selected in 
these setups, making utilitarian efficiencies lower. The more confident type-two voters are that y does not have a chance, the less 
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Fig. 8. Utilitarian efficiencies under PV with a modicum of intensity information in signals (A=0.8). 


698 A. Lehtinen / European Journal of Political Economy 24 (2008) 688-704 


é 3 ? T T T T 
[Lo 
or 
> UE, (P=0.001) 
uf 70+ |. -uE,,(p=0.008) 
2 —+— VE, ,,(p=0.009) 
ut gol |e UEcu(e=0.013) 
5 
50 
40 
30 
05 055 06 065 07 O75 08 085 09 095 1 
Cc 


Fig. 9. Utilitarian efficiencies under PV with A=0, and T=0.2. 


their intensities affect their decision and the more their beliefs do. When voters are less confident, there are more type-five voters 
who give a second vote to x than type-two voters who give a second vote to z. 

We could conclude that the high utilitarian efficiencies under AV are not mainly due to the fact that voters obtain information 
on preference intensities under this rule. The main difference does not derive from this difference in the content of signals. On the 
other hand, it is now seen that utilitarian efficiencies are indeed higher under PV if voters obtain perturbed information on 
intensities. 

There are reasons for why voters obtain, or at least should try to obtain, information on preference intensities under PV. Voters 
must be assumed to obtain some information on the aggregate intensities under PV for the following reasons. First, as noted in the 
previous section, thus far the signals under PV have been based only on preferences for the most-preferred candidates, and we 
know that this leads to unrealistic behavioural assumptions. Voters may give a second vote to a candidate for whom their 
intrapersonal intensity is only 0.0001 if the two pivot probabilities for the best candidate are both exactly zero. Secondly, if voters 
are able to take other voters' strategic behaviour into account, they do this by assuming that candidates with high average utility 
will obtain more votes than information on mere preference orderings would imply. 

Although there is no single unambiguous source of intensity information in the real world, it is plausible to assume that voters 
have some indirect knowledge about preference intensities. The support of some candidates, for example, may be geographically 
limited, and there may be fierce opposition elsewhere. The fact that a candidate's views are radical or modest also provides some 
clues. 

Let U denote the sum of utility for all candidates, and U(j) the sum of utility for candidate j. Let \=[0.1] denote the relative 
share of intensity information in the signals. A composite signal consists of a combination of preference and intensity information, 
and a random term: 


U(j) 
U 


Sij — Nj + (1 = 2) + pRi, (22) 


where R; and p have the same interpretations as before.'* When A=1, the pivot probabilities are based only on information on 
preference orderings. The results of simulations under PV are shown in Figs. 7 and 8. 

Under PV the utilitarian efficiencies are considerably higher in setups with full intensity information (A =0) than with just a little 
intensity information (A=0.8), and they are similar to the results under AV in the former. This is because the voters no longer give 
strategic votes to y just because they believe that x or z does not have a chance of winning in setups with a strong correlation. 

It has been shown that full intensity information may result in almost all type-five voters voting for x, but almost all type-two 
voters voting for z. As the logic of counterbalancing suggests, this phenomenon should disappear if strategic votes are given only if 


4 This way of modeling intensity information implicitly assumes that voters have information on interpersonal comparisons of utilities. Note, however, that 
even though voters are assumed to obtain perturbed signals of the sums of utilities, it is not necessary to assume that they make (perturbed) interpersonal 
comparisons of utilities. To see why, consider Myerson's (1985) argument about those comparisons: there are no unambiguous choices that reveal information on 
interpersonal comparisons. However, voters are assumed to use the perturbed sums of utility as proxies for predicting other voters' choice behaviour. Therefore, it 
would be possible to model intensity signals in such a way that voters obtain perturbed information on each voter's preference intensity separately. However, it is 
not plausible to assume that they obtain such information on the individual intensities, beliefs, and behavioural dispositions of thousands of heterogeneous 
voters. It is more plausible to assume that the voters have perturbed aggregate-level intensity information rather than individual-level information. Hence, if we 
are concerned that the voters are not able to observe information on aggregate intensities in mass elections, the problem is not realistically remedied by using 
individual signals. One would rather have to concede that voters cannot realistically take game-theoretical considerations or intensities into account at all in this 
context. 
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Fig. 10. Utilitarian efficiencies under AV with T=0.2. 


voters' intrapersonal intensities for the second-best candidates are higher than a threshold value T. The previous simulation results 
were implicitly based on a threshold of T=0. However, it is rather unrealistic to assumed that real people act this way. Fig. 9 shows 
the utilitarian efficiencies under PV when the threshold is T=0.2. 

This figure shows that removing only the most extremely unrealistic strategic voting also makes utilitarian efficiencies very high 
under PV. A similar effect also occurs under AV (see Fig. 10). 

Full intensity information together with an intensity threshold for giving strategic or second votes thus makes the utilitarian 
efficiencies very high under both AV and PV. The rest of the small difference in utilitarian efficiencies between the two rules may 
be attributed to the fact that the decision to vote for a second candidate is based on positive expected gain, but the decision to 
vote strategically is based on higher expected gain. Given that this remaining difference is rather small, most of the difference in 
the initial setups shown in Figs. 5 and 6 may be attributed to the fact that strategic behaviour was not realistically modelled. 
However, given that utilitarian efficiencies are clearly lower under PV if the signals do not contain intensity information even if 
an intensity threshold is used (see Fig. 11), AV could be said to be more robust than PV with respect to the informational 
assumptions. 


5.5. Bullet voting 


If all voters engage in bullet voting, AV reduces to plurality voting with SV behaviour (Saari, 2001), and the utilitarian 
efficiencies are fairly low. I will now show, however, that if voters engage in strategic behaviour, all voter types should engage in 
bullet voting only if they have unrealistically high confidence in their signals. With realistic confidences, there are always some 
voters who do not have an incentive to engage in bullet voting. 
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Fig. 11. Utilitarian efficiencies under PV with A=1 and T=0.2. 
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Fig. 12. Utilitarian efficiencies under AV with 21 voters. 
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Fig. 13. Condorcet efficiencies under AV. 


The results for o=p=0.001 under AV stand out as different from those derived with other parameter values. Utilitarian efficiencies 
are relatively low when voters' degree of confidence in their signals is high (ois small). This can be explained as follows. In setups 
with o=0.001, the variance of the term Sa is very high. For example, with o=0.001 it was 0.3461, but with 0=0.005, it is already 
as low as 0.0138. Large values of i Smax often render zero pivot probabilities, and thereby lessen the total number of votes given to 
second-best candidates. For example, in setups with C=0.5 and o=p=0.001, the average percentage of voting games in which all 
three probabilities were exactly zero was 42.27 and the average percentage of voting games in which all three probabilities were 
nonzero was 16.08. Changing o to just 0.005 changes these percentages to 0 and 99,32, respectively. When all pivot probabilities 
are zero, all expected utilities (E‘, E,, and E‘) are zero, and voters only give a vote to their most preferred candidate, thereby 
engaging in bullet voting.'® Voters thus give a vote only to their most preferred candidate because they believe that their vote has 
no chance of affecting the results. 


15 Part of the explanation for this dramatic difference may derive from the fact that the probabilities in the simulation model were not calculated using the 
function (Anordf in the IMSL library) that yields probabilities from a normal distribution function, every time a probability was to be computed. The probability 
values were fetched from a pre-existing table. It was necessary to construct the code in such a way because using the Anordf-function every time a probability 
was to be calculated would have made running the code prohibitively time-consuming. Since the preexisting table must be finite, the probability that it yields 
must be set to exactly zero (0 rather than e.g. 0.00000001) at some point. This cutoff point was set to 8. For all values of Sma larger than this, the function thus 
yielded the probability zero. It is also worthwhile noting that a fixed table is limited in its capability to distinguish between different extremely small 
probabilities. It contained 8001 entries. It is advisable to be cautious in interpreting the results derived with o = 001 due to these computational limitations. 


A. Lehtinen / European Journal of Political Economy 24 (2008) 688-704 701 


With 2001 voters, o=0.001 is clearly unrealistically low. As argued above, if the voters’ signals were derived from polls, it would be 
reasonable to use a fifteen-fold standard deviation. The results were derived for all possible parameter value combinations for 0 
and p. They are not displayed here, however, because all the relevant information concerning the confidence in the signals and 
their reliability is already contained in Fig. 6: all that matters is whether o is very small or not. If it is not, utilitarian efficiencies are 
very high, irrespective of the other parameter values. 


5.6. The size of the electorate 


What happens if the size of the electorate is changed? Changing it upwards is not computationally feasible but a smaller 
electorate can of course be studied. The size of p has to be changed however. With N=21, for example, p = \/ = 0.1029. The 
reasonable values for p and o are thus about tenfold compared to the previously used ones. Fig. 12 shows the results for a small 
electorate under AV. 

Strategic behaviour is more clearly welfare-increasing in small electorates but utilitarian efficiencies are lower throughout the 
range of setups. Why, then, do utilitarian efficiencies increase with the size of the electorate? My main conjecture is that 
idiosyncratic differences in voters’ utilities and beliefs may play a disturbing role in small electorates, but their importance 
diminishes in larger electorates as long as there are systematic differences in aggregate utilities for the candidates. To be more 
precise, it is possible that the differences in utilities are not fully reflected at the level of aggregate votes in small electorates. In 
large electorates, random variations in beliefs cancel each other out, as long as there is no reason to expect the beliefs to be 
systematically distorted, but the intensity differences will remain. There is thus reason to believe that with a modicum of 
correlation between preference intensities and preference orderings, the utilitarian efficiencies will approach 100 per cent as the 
size of the electorate increases, even if the voter types are generated with an impartial culture. 


5.7. Condorcet efficiencies 


The utilitarian efficiencies are relatively low in setups with o=0.001 because the voting outcomes are similar to those that 
would have been obtained if all voters voted sincerely under the plurality rule. This conclusion may be verified by considering 
Condorcet efficiencies, see Fig. 13. 

For obvious reasons, Condorcet efficiencies are relatively low when the correlation between voter types and intensities is 
strong; approval voting, under both SV and EU behaviour, is more responsive to preference intensities than to preference 
orderings.'° However, when 0=0.001, this is not the case, and the Condorcet efficiencies correspond exactly to those under the 
plurality rule if all voters engage in SV behaviour (See Fig. 12 in Lehtinen, 2006b). 


6. Robustness with regard to interpersonal comparisons 


How should these results be interpreted? Strategic voting and second votes typically increase utilitarian efficiency but decrease 
Condorcet efficiency. I am willing to argue that if the utilitarian winner and the Condorcet winner are not one and the same 
candidate, then the utilitarian winner ought to be selected. Since voters show by their behaviour that preference intensities are 
important, it would be odd to deny their normative importance in evaluating the candidates. 

The main argument against utilitarian efficiency is that it is impossible to observe what the sum of the utility is because it is 
impossible to obtain exact information on interpersonal comparisons. | will now show, however, that the consequences of strategic 
voting and behaviour are highly robust with respect to different interpersonal comparisons. If the results are similar irrespective of 
the interpersonal comparison used, then they do not depend crucially on them. 

The justification for using preference intensities in evaluating the candidates relies on two normative arguments. The first is 
that the intensities are normatively important, and the second is related to the one-man-one-vote principle: there must be some 
limit to the variation in individual utilities. 

The utility scales were defined by taking the highest and the lowest values of three randomly drawn numbers from the [0,1] 
interval rather than by simply drawing two numbers. Using just two numbers would imply that some voters might weigh 
a hundred times more than some others in the sum of utilities even though their decisions are just as important as everybody 
else's. This would be the case if the scale for one individual was, say, [0.50, 0.5001] and the scale for another was, say, 
[0.01, 0.995]. 

Several different variations in utility scales will thus be tried in order to see whether the results are robust with respect to 
different interpersonal comparisons. In order to retain comparability with previous results, all these variations need to change the 
utility scales without changing the preference orderings, the intraindividual preference intensities, or the average utility of all 
candidates. It is thus necessary to hold the parameters that determine individual behaviour fixed in evaluating robustness to 
interpersonal comparisons. 

The utility scales must be changed in such a way that they are systematically different between different voter types. The 
utilities of voters of types one, three, five and six were thus changed. The average utility for each voter type was retained, but 
the utility scale, i.e. the difference between the maximum and minimum utilities, was made smaller (larger) for voters of types 


16 These results could be compared to those obtained by Felsenthal, Maoz and Rapoport (1990). See also Felsenthal and Maoz (1988). 
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Fig. 15. Utilitarian efficiencies in the negative correlation setups. 


one and six, and that for voter types three and five was made larger (smaller). The utility scales of those who put candidate y 
second were thus shrunk and the scales of those who put candidate x second were stretched. Given that in setups with 
correlated intensities the intensities for x are on average higher than those for y, this effectively diminishes the importance of 
those who put y second and increases the importance of those who put x second. This variation in interpersonal comparisons is 
henceforth referred to as the ‘mutually reinforcing correlation setup’ because the intrapersonal intensities are high, on average, 
for the same voter types whose interpersonal intensities weigh most in the sum of utilities. A second variation reverses the 
interpersonal correlation but retains the intrapersonal correlation by stretching the scales for voters of types one and six, and 
shrinking those for voters of types three and five. The second variation is henceforth referred to as the ‘negative correlation 
setup’. 

Let IPC denote a parameter that reflects how much voters’ scales are shrunk or stretched.”” The original utilities are Uj, U> ,and 
U3. Let U; and U3 denote the maximum and minimum utilities for voters of types one and six after their scales have been shrunk 
(U,<U, and U3>U3). Since the idea is to subtract as much from U; as is added to U3, U3-U3=U -U;. U; (and U3) is obtained by 
adding to (subtracting from) the midpoint of the utility scale wes a part of the individual's scale Pee“) so that 


U, + U3 + IPC - (U, — U: 
Ui! 1 3 5 ( 1 aM) (23) 


The formal analysis in this section closely follows Lehtinen (2007b). 
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and 


— — [U, + U3 — IPC - (U; — Us)] 
2 


=U; +U3— Uy. (24) 


Similarly, let U; and U3 denote the maximum and minimum utilities for voters of types three and five after their utility scales 
have been stretched. The idea now is to add as much, on average, to U; as was subtracted from voters of types one and six. Thus, the 
difference between “> and #(\4'-"s) js added to the original U; so that 


= (1 — IPC)(U; — Us) 


U; =U, + 5) =2-U,-U,. (25) 


Again it is required that U;-U 3=U,-U;, so that 


Us = U3 —U, + U; = U3 — U, + U4. (26) 


What remains is to rescale the interpersonal intensities in such a way that their intrapersonal relative values remain unchanged. 
Let Ushrmk and usec denote these two intensities. Then 


psbrink — 7, 4 U5 (U, = Us), (27) 
and 


ugrerh = U, + U2 (Ui — U3). (28) 


In the mutually reinforcing correlation setup voter types one and six have utilities (U,, US", U3), voter types three and five 
have utilities (U;, US"°*", U3), and voter types two and four have utilities (U;, U2, U3). 

The utilitarian efficiencies from these two setups are displayed in Figs. 14 and 15. 

These figures show that utilitarian efficiencies are indeed affected by the interpersonal comparisons, but that the consequences 
of strategic behaviour remain relatively beneficial irrespective of them. The results are thus robust with respect to interpersonal 
comparisons. 


7. Conclusions 


Approval voting yields relatively high utilitarian efficiencies irrespective of the behavioural assumption used. The beneficial 
welfare implications of this voting rule are thus relatively robust with respect to the voters' behavioural dispositions. 

The findings confirm that the preference-intensity argument for AV is valid: approval voting reflects preference intensities rather 
well, and better than other voting rules. However, once a model of strategic behaviour under PV is realistic enough, this voting rule 
also reflects intensities very well. AV yields higher utilitarian efficiencies than PV particularly if voters’ signals do not contain any 
intensity information. However, if voters do obtain intensity information and if they vote strategically only if their preference- 
intensity for their second-best candidate is higher than a fairly low threshold value, both rules yield very high utilitarian efficiencies. 

The social choice literature commonly refers to the plurality rule as the worst of the widely used voting rules. Indeed, the 
prevailing knowledge concerning voting rules makes a mystery of its being so widespread. Thus far the only justification for PV 
seems to have been that it is simple. The findings reported here show that this rule actually performs very well, at least if 
performance is evaluated in terms of utilitarian efficiency. On the other hand, approval voting performs even better because 
utilitarian efficiencies are often close to one hundred per cent under this rule. It is thus to be considered a serious candidate when 
electoral reforms are contemplated. 

The methodological importance of these findings lies in shifting the focus of analytical discussions. Most of the existing 
literature on strategic voting concerns attempts to find voting rules that are less susceptible to strategic voting, and to compare 
them in terms of how prone they are to strategic manipulation. Since the findings indicate that strategic behaviour is beneficial, the 
important question is rather to compare various voting rules with respect to how beneficial strategic behaviour is. There are 
important differences between the various voting rules: AV and PV seem to be the top two in this respect (compare the results 
presented here to those in my earlier papers). 
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